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The paper addresses design/building frameworks for some kinds of tree- like and hierarchical struc- 
tures of systems. The following approaches are examined: (1) expert-based procedure, (2) hierarchical 
clustering; (3) spanning problems (e.g., minimum spanning tree, minimum Steiner tree, maximum leaf 
spanning tree problem; (4) design of organizational "optimal" hierarchies; (5) design of multi-layer (e.g., 
three-layer) k-connected network; (6) modification of hierarchies: (i) modification of tree via condensing 
of neighbor nodes, (ii) hotlink assignment, (iii) transformation of tree into Steiner tree, (iv) restructuring 
as modification of an initial structural solution into a solution that is the most close to a goal solution 
while taking into account a cost of the modification. Combinatorial optimization problems are considered 
as basic ones (e.g., classification, knapsack problem, multiple choice problem, assignment problem). Some 
numerical examples illustrate the suggested problems and solving frameworks. 

Keywords: hierarchy, multi-layer structure, tree, networks, ontology, information systems, solving 
strategy heuristics, combinatorial optimization, knapsack, multiple choice problem, spanning tree, Steiner 
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1. INTRODUCTION 

Hierarchies play a crucial role as a very useful model for complex systems in all domains: (1) hierar- 
chies leads to a decomposition (i.e., partitioning) of the system representation and corresponding system 
problems (e.g., design, improvement, maintenance); (2) a hierarchy is often an excellent basis for fast 
algorithms and/or solving procedures design (i.e., an analogue of linear or convex functions in continuous 
mathematics); (3) hierarchical approach is the best one for structures of information/knowledge (e.g., 
ontology); (4) hierarchical approach is the best one for problem solving strategies (e.g., decision trees); (5) 
hierarchies are are very understandable for humans and can be used as a basis of easy learning/teaching 
interactive procedures. Evidently, many years various methods have been used to design the hierarchical 



structures (e.g., p, a, m\, m, m, eg, Eg, m, m, m, m, m, urn ma, msi, nsu, 
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In the article, the following two basic problems are examined: 

I. Design approaches to design the hierarchies (including expert-based procedure, hierarchical clustering 
and spanning trees). 

II. Schemes for transformation of hierarchies (e.g., hotlink assignment, transformation of tree into 
Steiner tree, restructuring). 

Mainly, the considered problems are briefly described via a framework: (i) engineering description, (ii) 
problem formulation(s), (iii) some problem versions, (iv) basic solving schemes, (v) new prospective prob- 
lem versions (e.g., multicriteria problems, problems under uncertainty). Numerical examples illustrate 
the described approaches. 

2. BASIC HIERARCHIES AND SOME DESIGN APPROACHES 

2.1. Basic Types of Hierarchies 

Generally, it is reasonable to point out some basic types of hierarchies (e.g., [75], [95], [115] ): 

(1) various kinds of trees (e.g., Fig. 1, Fig. 2, Fig. 3) (e.g., [71 ] . [75 ] . [TT5 ] ) : 

(2) organic hierarchy (i.e., with organic interconnection among children- vertices, Fig. 4) [39 ; 

(3) "basic" hierarchy as a tree with additional edges (Fig. 5) (e.g., |122j ): 

(4) "morphological hierarchy" (e.g., [323] , [124] , [128] ) (Fig. 6); 

(5) multi-layer structures (e.g., multi-layer networks, hierarchical networks) (Fig. 7) (e.g., [T] . [16 ] . [21 ] . 
[98].[T30].[I68]1. 

Here it is reasonable to point out some important research directions in modeling of various multi-layer 
graphs/networks, for example: (a) hypergraphs (e.g., [17], [H]) and hypernetworks (e.g., [93]. |105] ); (b) 
multi-layer social networks (e.g., |109] . |152] ): (c) multi-stratum networks (e.g., |153j ): (d) multi-layer 
computer systems [196]; (e) multi-layer communications |195) : (f) multi-layer (hierarchical) information- 
communication networks (e.g., [I] . p] . [21] . [118] . pfi] . p5] . [162] . p8] . [188] ). 
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Fig. 6. "Morphological" system hierarchy ([i"28j.[i"3"0]) 
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Fig. 7. Multilayer structure 

In applied domains, many special types of tree-like structures or hierarchies are widely used, for ex- 
ample: (1) hierarchical schemes for data, for information systems (e.g., [27], |115| . [120] . [122] . [178] ): 
(2) thesauri and concept spaces (e.g., [34], [35]); (3) organizational hierarchies (e.g., [14], [61], [87], |160j . 
[202] ): (4) multi- level complex systems (e.g., |158j ): (5) phylogenetic trees (e.g., |159j . |179j . [185j ) and 
evolutionary trees (e.g., [7], [159]); (6) ontologies (e.g., @T], [99] , [164] , [201] ) ; (7) statecharts (e.g., [23], 
[97]): (8) decision trees (e.g., 0], [75], [79], [83], [84], [176], [177]); (9) hierarchy of criteria in decision 
making (Analytic Hierarchy Process) [182] : and (10) hierarchical access networks (e.g., |81]V 

Fig. 8 depicts a basic design process to obtain a hierarchical structure. 
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Fig. 8. Building of hierarchy 
It is reasonable to list our main types of hierarchy design problems as the following 



Problem 1. Expert-based design: 
Input: description of a system. 

Output, hierarchy as a result of system partitioning (tree, hierarchy, layered structure). 
Problem 2. Basic design: 

Input: set of elements, element attributes or element interconnection. 

Output: set of clusters or spanning/covering structure over the initial element set (tree, hierarchy, 
layered structure). 

Problem 3. Spanning/covering: 

Input: initial network/graph (i.e., set of elements and set of edges). 

Output: spanning/covering hierarchical structure (tree, hierarchy, layered structure). 

Problem 4. Redesign (modification, transformation, improvement): 

Input: initial hierarchical structure (tree, hierarchy, layered structure). 

Output: new hierarchical structure with some required features (tree, hierarchy, layered structure). 
Problem 5. Restructuring (special case of modification): 

Input: (i) initial hierarchical structure (tree, hierarchy, layered structure) , (ii) goal hierarchical structure 
(tree, hierarchy, layered structure). 

Output: new hierarchical structure (tree, hierarchy, layered structure) while taking into account the 
following: (a) "cheap" transformation of the initial structure, (b) "small" proximity between the new 
structure and the goal structure. 

Note, an important problem corresponds to aggregating some several initial hierarchies into a resultant 
aggregated structure (e.g., aggregation/integration/merging of information systems, data base schemas, 
catalogs, ontologies, knowledge bases, organizational structures, experts preferences) has been intensively 
examined (e.g., [3], [H], [34], [40], [48], [M5], [165], [166], [III], [207], [2T3]). An author recent survey on 
aggregation of some structures is presented in [128] . 

2.2. Expert-based 'Top-Down' Procedure 

Mainly, expert-based procedures for building a system hierarchy are based on domain experts and 
some typical "technological' frames (e.g., product life cycle as the following: design, manufacturing, 
testing, maintenance, utilization, recycling). This procedure consists of the following phases (it is the 
'divisive' strategy of hierarchical clustering): 1. dividing a system into its subsystems; 2. dividing each 
subsystem into its parts; 3. dividing each subsystem part into its components; etc. It is reasonable to 
point out the basic algorithmic rules for dividing the system (subsystem, subsystem parts, etc.): (a) 
dividing (partitioning) by physical parts, (b) dividing by system functions, (c) dividing by time stages of 
data processing. Three applied illustrative examples are presented to illustrate the procedure above: 

(1) for concrete macrotechnology (Fig. 9) ( |124j . |135] ).: 

(2) for a two-floor building (Fig. 10) ( [133] . Q37J); and 

(3) for medical treatment (children asthma) (Fig. 11) f |124) . [136] ). 
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2.3. Hierarchical Clustering (Agglomerative Algorithm) 

Hierarchical clustering consists in building a hierarchy (tree-like structure) of clusters. There is a set 
of n elements A = {Ai, Ai, A n } and a corresponding vector estimate of m attributes/parameters 



(Ti, Tj, T m ) for each element i: 



(-2i,i, Zij, Zj, m ). The basic agglomerative algorithm 



(polynomial, algorithm 1) is as follows ('Bottom-Up' element pair integration process) (e.g., [85], |191j ): 

Stage 1. Computing the matrix of element pair V(A(ii), Afa)), A(ii) G A, A{i2) £ A, ii ^ 12 
"distances" (a simple case, Euclidean distance): 



Stage2. Revelation of the smallest pair "distance" and integration of the corresponding two elements 
into a resultant "integrated" element. 

Stage 3. Stopping process or re-computing the matrix of pair "distances" and Go To Stage 2. 



G 



As result, a tree-like structure for the element pair integration process ('Bottom-Up') is obtained (one 
element pair integration at each integration step) . A basic procedure for aggregation of items (aggregation 



as average values) is as follows (Ji u i 3 = A^hA^): Vj zj i 



The item pair aggregation 



•J ~ 2 

process can be based on other functions (e.g., max, min ). Integration of several items can be considered 
analogically. An illustrative example corresponds to an analysis of 8 persons by their inclination/interests 
(Table 1). Here four attributes are used (ordinal scale [0,5]): (i) inclination for mathematics or logical 
thinking (Ki), (ii) interest to music (K2), (hi) interest to sport (-K3), and (iv) interest to trips (K4). The 
result of the basic hierarchical clustering (as a hierarchical structure for eight-person team) is depicted 
in Fig. 12. 
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Fig. 12. Example of basic hierarchical clustering 



Evidently, modifications of the agglomerative algorithms have been proposed (e.g., [65], [194] ). Our 
approach to modifications of the agglomerative algorithm above is the following |125j . 

First, computing the item "distance" based on metric l 2 is a very 'simplified' mathematical approach. 
It is possible to examine an ordinal sale for the item "proximity" . As a result, it is possible to select the 
smallest ordinal item "proximity" and integrate corresponding item pairs. 

Second, multicriteria approach for the item "proximity", (e.g., Pareto approach [169 ) can be used. 

Third, clustering can be organized as a series revelation of cliques [75] . 

In addition, it is necessary to note clustering processes can be examined as optimization problems. The 
basic approach to goal functions in clustering is based on the following: (1) inter-cluster "distances" (i.e., 
proximity between elements in the same clusters), (2) intra-cluster "distances" (i.e., proximity between 
elements in different clusters). Some other objectives can be used as well (e.g., number of clusters or 
closeness to a required interval of cluster numbers, cluster's cardinalities). Some other objective functions 
can be used as well (e.g., [lOOj . [106] )■ Thus multicriteria problem formulations based on criteria above 
may be examined as well. Generally, it may be very prospective to integrate hierarchical clustering and 
expert-based interactive procedures (previous section) for the design of system hierarchies. 
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2.4. Towards Ontology 

In recent two decades, ontology based approaches have been widely used for representation and pro- 
cessing of information in various domains, for example (e.g., [H],[7I], [SB], [95 ] . [147] . |163] . |164] . |201] ): 
knowledge-based systems, system design, systems engineering, library science, chemistry, biomedical in- 
formatics, conceptual modeling, semantic Web. Ontologies are artifacts as structures (logical, linguistic, 
"taxonomical" ) for description of a domain and/or a "space" of tasks (e.g., |164] . [2"0l"] ). In fact, hier- 
archical (multi-layer) structures are used here. Basic problems over ontologies are the following (e.g., 
[41" ] .[95 ] . [164| . [165| . |166] . [2"ul"] ): (a) design, (b) comparison, (c) integration/merging, (d) alignment. On- 
tology is often presented as the following activity over various patterns (e.g., logical, reasoning, archi- 
tectural, naming, content): searching, selecting, composing. A framework for ontology design has been 
suggested in |164) . An approach to ontology extraction was presented in [73] : 

Stage 1. Preprocessing (a preliminary work on the available documents is carried out). 
Stage 2. Creation of the first version of the ontology (as a structure) . 

Stage 3. Creation of concept and relationship. (The creation of the whole ontology extracting the 
concepts and their relations from the text documents is carried out). 

Stage 4- Harmonization. The extracted ontology is "harmonized" through the analysis of other domain 
ontologies and concept description from other systems (e.g., Wikipedia). 

Stage 5. Refinement and validation. The resultant ontology is refined and validated. 

The following main resources for ontologies design are usually pointed out: informal data structures, 
concept schemes (e.g., classifications, thesauri, nomenclatures), Web-based resources, natural language 
documents, lexical resources (e.g., dictionaries), modeling languages (e.g., UML, Petri nets). Evidently, 
modular approaches (based on modular architecture) may be widely used in the ontology design and 
ontology "life cycle" (i.e., creation, evaluation, testing, utilization, modification). 

2.5. Spanning Trees 

Evidently, spanning tree problems can be used for the design of hierarchical system models, in the case 
when a preliminary network system model over system elements exists). Many decades, spanning trees 
problems are used in applications (e.g., network design and maintenance, communication protocol design, 
VLSI design) and intensively studied in combinatorial optimization (e.g., [42], [59], [75], [82], [101j . [115] . 
[210] . |211j ). In this section, three basic spanning problems (and their modifications) are briefly described: 
(a) minimum spanning tree problem, (b) minimum Steiner tree problem, and (c) maxim leaf spanning 
tree problem. 

2.5.1. Minimum Spanning Tree and Steiner Tree Problems 

In this section, a brief description of spanning trees problems is presented (from [126] ). Table 2 
contains a list of basic spanning tree problems and corresponding literature sources. Fig. 13 illustrates 
two spanning tree problems. 



Initial graph Spanning tree Steiner tree 

(4 Steiner vertices) 




Fig. 13. Illustration for spanning trees [130 



The basic spanning problem is the minimum spanning tree or the minimum weight spanning tree problem 
(e.g., [42], [57], [72], [75], [172] ). Let G = (A,E) be a connected graph (A is the set of vertices, E is the 
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set of edges/arcs) with nonnegative weights of edges/arcs. 

A spanning tree of the graph T — (A, E') {E 1 C E) is a subgraph which is a tree and connects all 
the vertices together. The total weight (cost) of the spanning tree c(T) is the sum of weights of all its 
edges/arcs (i.e., E'). A minimum spanning tree or minimum weight spanning tree T* is a spanning tree 
with the total weight less or equal to the weights of every other spanning tree c(T*) = min/jn c(T). The 
problem is polynomially solvable (e.g., [32], [72], [75], [172] V The basic well-known polynomial algorithms 
are the following (e.g., [42], [75]: (a) Prim's algorithm, (b) Kruskal's algorithm, (c) Boruvka's algorithm. 

In more general case, a spanning structure corresponds to spanning forest: any graph (not necessary 
connected) has a minimum spanning forest which is a union of minimum spanning trees for its connected 
components, (e.g., [75], |173j ). 



Table 2. Studies in spanning trees 



Spanning problem 


Sources 


1. Spanning tree 




1.1. Minimum spanning tree 


[Tol 1701 f7£l 11 70I 

I42j,l72],[75j,ll72j 


1.2. Minimum diameter spanning tree 


k n i 
[oU| 


1.3. Minimum spanning forest 


IT^l 11 7^1 


1.4. Minimum spanning multi-tree 


i i , lUo , lyo 


1.5.Multicriteria spanning tree 


i, [33], [51], [96] 


2.Steiner tree 




2.1. Bottleneck Steiner tree problem 


[T2],[5S],[SS] 


2.2.Steiner tree problem with minimum 


[32], [58], [59], [143] 


number of Steiner points 




2. 3. Terminal Steiner tree problem 


[54] .[68]. [144] 


2. 4. Node weighted Steiner tree problem 


P].[TT3].[T5g].[2l6] 


2. 5. Prize-collecting Steiner problem 




2.6.Survivable Steiner network problem 


59 


2. 7. Steiner tree coloring problem 


m 


2. 8. Steiner tree scheduling problem 


m\ 


2. 9. Constrained Steiner tree problem 


@4],|180| 


2. 10. Steiner tree problem with hop constraints 


H,|204] 


2. 11. Steiner tree problems with profits 


m 


2. 12. Generalized Steiner problem 


[2] .[TT]. [56]. [57] .[209] 


2. 13. Generalized Steiner star problem 


1112] 


2. 14. Stochastic Steiner tree problem 


mm 


2. 15. Dynamic Steiner tree problem 


1102] 


2. 16. On-line Steiner tree problem 


0,[H],[2O3] 


2. 17. Group Steiner tree problem 


ES],[6Q] 


2. 18. Steiner forest problem 


m 


2.19.Multicriteria Steiner tree problem 


|140|.|141|.|206] 


2.20.Multicriteria Steiner tree problem 


urn 


with the cost of Steiner vertices 





In recent years, multicriteria spanning tree problems (or multi- objective spanning tree problems) are 
examined (e.g., [9], [33], [51], [96]). Here a vector-weight corresponds to each graph edge/arc and the 
objective function is a vector-like one where each component of the vector above equals a sum of cor- 
responding components of the edges/arcs weights: c(T). In this case, it is reasonable to search for 
Pareto-cfhcicnt (by c(T)) solutions. This problem is NP-hard (even in bicriteria case) (e.g., [9"] . |33] ) . 
Here the following algorithms are used (e.g., [9], [33]): (i) multicriteria Prim's spanning tree algorithm, 
(ii) multicriteria Kruskal's spanning tree algorithm, (iii) genetic algorithms, (iv) multiobjective evolu- 
tionary optimization algorithms, (v) heuristics (e.g., GRASP algorithm, simulated annealing algorithm), 
and (vi) knowledge-based approaches. 

In the case of Steiner tree problem (e.g., [2J , [H] , [55] , [57J , [SI] , [73] , [52] , [TUT], [H3] , [IHFJ , [102] [2TUJ ) , the 
spanning tree can include additional vertices (i.e., Steiner vertices). Thus, the total weight (cost) of the 
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resultant spanning structure can be less than in the case of the basic spanning tree problem. The basic 
formulation of Steiner tree problem is the following. Given an undirected connected graph G — (A, E) 
with node/vertices set A, edge set E, and nonnegative weights associated with the edges. Given a set Q 
of specified vertices (terminals/basic vertices, i.e., Steiner points). The problem is: 

Find a minimum cost subgraph T s — (Q,E') C G (E' C E) such that there exists a path in the 
subgraph T s between every pair of basic vertices. 

Here the optimal solution T s is a tree (Steiner tree). The Steiner tree problem is NP-hard. Descriptions 
of many Steiner tree problem formulations are presented in |82j . Mainly, the following solving approaches 
are used for the Steiner tree problems: (1) exact algorithms (enumerative algorithms, for example, branch- 
and-bound, branch-and-cut) (e.g., [37], |146j ) and (2) various heuristics (e.g., [44], [203], [205] ) including 
the following: (a) fast (greedy) algorithms (e.g., [3"T] . [H]); (b) approximation algorithms (e.g., 0, [5"9"] . 
[EH], [S], [HU, Q33], [3T6]); (c) genetic algorithms (e.g., [TOS]); (d) Al-based methods (e.g., [TOT]); (e) dual 
heuristics (e.g., [H5]); (f) distributed primal-dual heuristic (e.g., [187] ): and (g) local search techniques 
(e.g., |22|). 

In multicriteria Steiner tree problems (e.g., |140) . [T4Tj . |206j ) a vector weight for each edge/arc is under 
consideration, and the vector-like objective function (as a vector of corresponding sums) can be used (in 
a simplest case): c(T s ). Thus, Pareto-efficient solutions (by c(T s )) are searched for (e.g., |140j . [T4Tj ). 
Here, the following algorithmic approaches can be pointed out: (i) basic heuristics (e.g., [206 ); (ii) 
special multi-layer macro-heuristics: (a) partition-synthesis heuristic (e.g., [123) b (b) spanning-tree based 
heuristic (e.g., |140j ). (c) composite multistage solving scheme (e.g., |141j ). 

2.5.2. Maximum Leaf Spanning Tree Problem 

The "maximum leaf spanning tree" problem is the following (e.g., [6], |75) . [114] ): 

Find a spanning tree of an input graph so that the number of the tree leafs is maximal. 

Generally, the spanning tree of a graph contains the following types of nodes: (a) root, (b) internal 
nodes (the internal nodes may be considered as a virtual "bus" in networking), and (c) leaf nodes. Thus, 
the problem consists in maximizing the number of leaf nodes or minimizing the number of internal nodes. 
The problem is one of the basic NP-hard problems [75 . Fig. 14 depicts an illustrative numerical example. 

Initial graph Spanning tree 




Fig. 14. Illustration for maximum leaf spanning tree 

The problem is applied in networking (e.g., minimum-energy broadcast trees in ad hoc wireless net- 
works), (e.g, [142] . [199] ). in circuit layout [193 . The following main types of algorithms have been 
suggested: (1) exact algorithms (branch-and bound) (|BB], [70]), (2) local optimization |148] . (3) greedy 
3-approximation algorithm (linear time) |149j . (4) 2-approximation algorithm [192) . and (5) heuristics 
(e.g., Bee Colony algorithm) (e.g., [55], [190j ). Some prospective generalizations of the "maximum leaf 
spanning tree" problem may be considered while taking into account the following: (a) multicriteria 
descriptions of leaf nodes, (b) uncertainty, (c) dynamics. 

In sense of exact algorithms, this problem is equivalent to "connected dominating set" problem (e.g., 
El], [30], [36], [75]): 
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Find a minimum set of vertices D C A of input graph G = {A, E) that the induced by D subgraph 
G' = (D,E') (E' C i?) is connected dominated set and £) is a dominating set of G. 

A recent survey on the problem is presented in 22 . The "connected dominating set" problem plays 
the central role in sensor wireless networks, in mobile ad-hoc networks (MANETs), in network testing 
(e.g., [22], [47], [142] . |199] V This problem is used in communication protocols including the following 
[22] : (i) media access coordination, (ii) unicast, (hi) multicast/broadcast, (iv) location-based routing, 
(v) energy conservation, (vi) topology control, and (vii) resource discovery in MANET. Mainly, the 
following algorithms are used for the problem: (i) approximation algorithms (e.g., [35], [EE])) (ii) heuristics 
(e.g., [28 ,[186 ). Analogically, prospective generalizations of the problem may involve the following: (a) 
multicriteria descriptions of the elements of the dominating set, (b) uncertainty, (c) dynamics. 

2.6. Towards Optimal Organizational Hierarchies 

Organizational hierarchies have been investigated many decades (e.g., [14] , [50] . [87], |154| . [160] . [197] . 
202 , [208 ) . The following basic approaches to organizational structures are used in firms (manufacturing, 
sales, etc.): bureaucratic structures, functional structures, division structure (or product structure), 
matrix structure (integration of functions and products). Generally, the following hierarchical layers are 
considered for organizations: (a) organization, (b) branches, (c) departments (divisions), (d) workgroups, 
(e) individuals. Fig. 15 depicts an example of hierarchical structure for universities. 
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Fig. 15. Example of hierarchical structure (university) 



In recent years, interests to design of 'optimal' organizational hierarchy have been increased (e.g., [87] . 
160 , 202 ). Here, approaches can be based on spanning structures (see previous section) or direct design. 
Two basic kinds of direct design problems may be examined (e.g., [122] . [126] ): 

1. to build a hierarchy that has the best value(s) of its property (ies); 

2. to build a hierarchy that is the most 'close' to an ideal one (or a set of ideal structures). 

Let {Hi — (Ai,Ei), i — 1, n} be a set of initial structures (hierarchies). Let H 9 (A g , E g ) (H 9 £ {Gi}) be 
a goal (i.e., ideal) structure, let p(Hi 1 , Hi 2 ) be a proximity (or 'distance') between two structures (hierar- 
chies) Hi 1 and Hi 2 (Vi/^, Gi 2 £ {Hi}). In addition, it is reasonable to consider properties of a structure 
(e.g., vertex degree, vertex connectivity) H ix £ {Hi}: ip(H tl ) = (t/j^HiJ, ?/> M (-ff H ),..., ipmiH^ )) . 
Here hierarchy H* £ {Hi} is searched for. Thus, in the 1st kind of the problem the objective function is 
(a case of maximization for each property): 



max ib,,(H*) Vu = l,m. 
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This problem is close to the well-known class of "graph augmentation" problems, i.e., modification of a 
graph to get some required properties of the modified graph (e.g., [63 ,[111 ). 

In the second kind of the problem the objective function is: min^.g^ p(H* , H 9 ). In general, 
multicriteria problem formulations may be examined as well. Clearly, it is possible to consider integrated 
problem formulations, for example: 

min p(H*,H 9 ) s.t. ip^H*) > # V \i = LTm, 

where (d 1 , gP, d m ) is a vector constraint for properties. The optimization models above usually 
correspond to complicated integer (or mixed integer) optimization problems and here various solving 
approaches are used (e.g., enumerative methods, heuristics, AI techniques). 

In addition, it is necessary to point out that an attempt to build generalized approach for 'optimal' 
organization hierarchy is presented in ([57], |160j . |202j ). This approach is based on the usage of a 
functional P of hierarchical structure G € SI (SI is the set of hierarchical structures as direct acyclic 
'layered' graphs; P : SI — > [0, +oo)) and problem is: argmincgn P{G). Mainly, structure G is 
considered as tree and functional P is considered as convex (|87j. |160j . |202j ). The set of applications 
involves technological process, supply chain network, etc. 

2.7. Multi-layer Structures 

2.7.1. Multi-layer Approach 

Multi- layer (or multi- level) approach is a basic one for representation of complex systems (e.g., [130] . [152] . 
158 ,[195 ,[196 ). In fact, this approach is a basic methodological method for decreasing the system com- 
plexity (i.e., multi-layer partitioning a systems and corresponding partitioning the system problems set). 
Here, the following main methodological steps can be pointed out. First, levels for main properties of 
complex systems have to be considered, for example: (i) stability, (ii) controlability, (iii) adaptability, 
and (iv) self-organization |158j . Second, seven-layer structure for computer systems was suggested as 
follows (e.g., |196j ): (1) layer of hardware, (2) layer of microprogramming, (3) layer of operation system, 
(4) assembler-based layer, (5) layer of algorithmic languages, (6) layer of applied support systems (e.g., 
DBMS, DSS), and (7) layer of applications. Third, seven basic layers (OSI model) for data transmission 
in communication networks were suggested (e.g., |195j . [215] ): (1) physical layer (media, signals and bi- 
nary transmission), (2) data link layer (physical addressing), (3) network layer (path determination and 
logical addressing), (4) transport layer (end-to-end connections, reliability and flow control), (5) session 
layer (interhost communication, managing sessions between applications), (6) presentation layer (data 
presentation, encryption and decryption, convert machine dependent data to machine independent data), 
and (7) application layer (network process to application). 

Evidently, the multi-layer approach can be applied in many domains. For example, four-layer structure 
was suggested for representation of combinatorial "optimization problems domain" in |130j : (i) layer of 
basic combinatorial optimization problems, (ii) layer of multicriteria combinatorial problems, (iii) layer 
of typical composite problem frameworks, and (iv) layer of typical applications. 

In general, a multi-layer system hierarchy consists of the following: (a) hierarchical layers; (b) set of 
elements for each hierarchical layer, description of the elements (i.e., attributes); (c) interconnections 
(some relations) over the set of elements for each hierarchical layer; (d) connections between elements of 
neighbor hierarchical layers. Thus, the following design framework for a multi-layer system hierarchy can 
be considered: 

1. Generation of multi-layer structure (i.e., the layer), for example: (i) dividing the initial ele- 
ments/nodes into parts corresponding to layers (levels), (ii) description of layer elements (nodes, arcs) 
(e.g., traffic), (iii) building a structure for each part, e.g., path, multiple paths, tree, ring, complete graph 
(i.e., clique) or their combinations. 

2. Definition (searching for) connections between elements of neighbor hierarchical layers. 

2.7.2. Typical Hierarchical Layers in Communication Network 

First, it is reasonable to describe a typical hierarchy of communication networks (e.g., [45], [76], |118| . 
[162] ). A traditional network hierarchy consists of the following basic layers: (a) international (multi- 
country, continent) network GAN; (b) metropolitan network MN; (c) wide area network WAN; and 
(d) local area network LAN. IBM Red Book contains an interesting dimensional classes of networks 
by node numbers as follows: (i) large size communication network ( > 500 nodes ); (ii) medium size 
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communication network (< 500 nodes); and (iii) small size communication network (< 80 nodes) [162) . 
From the "engineering" viewpoint, the following hierarchical layers can be considered: 

1. Backbone network. 

2. Global network as a set of interconnected network segments including the following: (a) additional 
centers, (b) cross-connections, and (c) bridges. 

3. Access network / network segment (cluster): (e.g., bi-connected topology, about 20 nodes). 

4. Distributed network: a simple hard topology (e.g., bus, star, tree, ring). 

As a result, a class of small-dimensional networks is added to the above-mentioned classification: 20. ..25 
nodes. Here a simplified network hierarchy is examined as follows (Fig 16): 

1. TOP LAYER: nodes and links for connection of clusters: 1.1. basic node clusters (network 
segments), 1.2. communication centers, 1.3. cross-connection links (center's connection), and 1.4. links 
for connection of neighbor clusters. 

2. MEDIUM LAYER: basic clusters (network segment/ access networks, bi-connected topological 
modules). 

3. BOTTOM LAYER: distribution networks (e.g., tree, ring, bus). 
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computing 
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Fig. 16. Illustration for multi-layer communication network 



2.7.3. Layered k-connected Network 

A special version of 2-connected network was suggested in |16j . The generalized fc-connected network 
of this kind and its design scheme were briefly described in [130] . Let G = (A, E) be an initial graph 
(network) where A is a vertex (node) set, E is an edge set and \A\ > k x (fc + 1) + k. 

Our kind of /c-connected networks is: (a) k "centers" where each "center" is (k + l)-vertex clique 
(the "centers" have not intersections), (b) node set M C A ( \M\ > k ) includes all nodes which do not 
belong to the "centers"; there is a connection (i.e., edge) between Va 6 M and each "center", i.e., k edges 
(one edge for a connection to a "center"). As a result, the following three-layer network is obtained (Fig. 
17): (i) layer of end nodes, (ii) layer of special "central" communication nodes: selected nodes and/or 
special additional nodes (the nodes are useful for location of key communication equipment); and (iii) 
communication "center" (the "center" can correspond to "communication providers" / "communication 
operators" or their branches). 

Fig. 18 depicts an example of 4-connected structure. Proof of ^-connectivity for the defined kind of 
networks is based on four special cases. Consider two nodes a, b e A. The cases are as follows: 
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Case 1: a and b belong to the same "center" (i.e., (k + l)-clique). Nodes a and b have connections 
as follows: (i) direct connection (a, b), (ii) (k — 1) two-edge connection by other nodes of this "center". 
Thus, nodes a and b have k connections (without intersection). 

Case 2: a belongs to "center" A and b belongs to "center" B (i.e., another one). Nodes a and b have 
connections as follows: two-edge connection by another node a e M (i.e., (fc)-connections of this kind). 
Thus, nodes a and b have k connections (without intersection). 

Case 3: a belongs to a "center" and b € B. Nodes a and b have connections as follows: (i) direct 
connection (a, b), (ii) (k — 1) three-edge connection: by a node of each other "center" and by another 
node from M ((fc — 1) different ways). Thus, nodes a and b have k connections (without intersection). 

Case 4: a,b E M. Nodes a and b have k different connections as follows: node a, a "center", node b. 



Layer 3: communication "centers" 

it it IT " 

Layer 2: special access nodes 



Layer 1: basic set of end users 



Fig. 17. Three-layer model of network 



Center" 1 "Center" 2 "Center" 3 "Center" 4 




End user 

Fig. 18. Illustration for 4-connected network 



Further, a solving scheme ('Bottom-Up') to build fc-connected structure is: 

Stage 1. Selection of k x (k + 1) vertices for k "centers" (here multicriteria ranking problem can be 
used). 

Stage 2. Clustering of the selected vertices to get k clusters as "centers" (each cluster consists of (fc + 1) 
vertices, the clusters have not intersections). 

Stage 3. Building of connections for end users: connection of each vertex (that does not belong to 
a "center") with each "centers" (i.e., with the only one node of each "centers"). Here multiple choice 
problem or its modifications can be used. 

Now an illustrative example of illustrate the examined type of bi-connected communication network is 
presented. Initial nodes are depicted in Fig. 19. 
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Fig. 19. Initial network nodes 
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Here the following nodes are selected for "centers": {3,5,9,10,12,14}. Other nodes correspond to 
users. 

The corresponding version (version 1) of the resultant bi-connected networks is presented in Fig. 20: 
Version 1. "Centers" are locally-allocated (i.e., regional "centers"): "center" 1: nodes 3, 10, and 12; 
"center" 2: nodes 5, 9, and 14. 



"Center" 1 



Center" 2 




Fig. 20. Bi-connected network: locally-allocated "centers 

On the other hand, it may be reasonable to consider another situation with "simplification" of connec- 
tions for end users (e.g., minimization of connection distances). In this case, it is necessary to use the 
following algorithmic rule at stage 2 of the above-mentioned solving scheme (i.e., grouping of the selected 
nodes to centers): 



Nodes of each "center" have to 'cover' the network (e.g., to be very close to all nodes of end users). 



Thus, the second solving scheme to build fc-connected structure is: 



Stage 1. Clustering of of the initial set of nodes to get k clusters. 

Stage 2. Building of k "centers" : selection of a vertex in each obtained cluster as an element for each 
"center" (a representative of the "center" ) , connection of the vertices in each "center" . 

Stage 3. Building of connections for end users: connection of each vertex (that does not belong to a 
"center") with each "centers" (i.e., with the only one node of each "centers"). 

Fig. 21 depicts an example of this type of the resultant bi-connected network: 

Version 2. "Centers" are distributed over the network: "center" 1: nodes 3, 9, and 12; "center" 2: 
nodes 5, 10, and 14. 



"Center" 1 "Center" 2 




Fig. 21. Bi-connected network: distributed over network "centers' 



2.7.4. Towards Hierarchical Network Design Problems 

The basic hierarchical two-level network design problem consists in finding a minimum cost two-level 
spanning network, consisting of two parts: (i) main path (or several paths, tree, ring) (ii) secondary trees 
(e.g., [45], [167] . |168j . |175j ). Thus, the initial network is divided into two parts: 
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1. The higher level part (primary nodes, main path): a path (or several paths, tree, ring) composed of 
primary arcs, which visits some of the nodes of the network (i.e., primary nodes). 

2. The lower level part (secondary nodes, secondary trees): the part is composed of one or more trees 
whose arcs, termed secondary, are less expensive to build than the primary arcs. 

Here, each arc has a cost (ckj, <E A, A is the set of nodes). The total cost of the selected arcs 
in the spanning structure is used as the minimized objective function. The problem is formulated as 
combinatorial optimization model (e.g., [45]), it is NP-hard [13]. Various approaches have been suggested 
for the problem, for example: (a) exact (enumerative) methods (e.g., branch-and-cut approach, dynamic 
programming), (b) heuristics (e.g., Lagrangian relaxation) , (c) evolutionary algorithms. Often, a prelim- 
inary minimum spanning tree is used to construct the solution (leaf nodes of a spanning tree are used as 
the secondary nodes). Further, three illustrative examples are presented: 

(i) higher part is path over primary nodes < 1, 6, 5, 7, 2, 3, 4 > (Fig. 22), 

(ii) higher part is tree over primary nodes {1, 2, 3, 4, 5, 6, 7} (Fig. 23), 

(hi) higher part is ring over primary nodes {1, 2, 3, 4, 5, 6, 7} (two-connected case) (Fig. 24). 

Main applications of the problem involve communication networks, computer networks, transportation 
networks, power line distributed systems. Evidently, more general multi-level networks are examined as 
well (e.g., [13], [38]). 




Fig. 24. Example of two-level network (higher level: ring) 
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2.7.5. Connection Assignment in Two-layer Network (Access Points - Users) 

This section focuses on assignment design for two-layer network (access points - users) (Fig. 25). 
The design problem consists in connection (assignment) of each user to access point. 



Located access points Q = {1, ■ ...,to} 



I 



Connections under assignment 



t t t t t t t 

Located end users W = {1, i, n) 



c 



Fig. 25. Layers: access points - users 
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The problem and initial data are partially based on [138 . The following parameters are used: set of 
users 4' = {1, i, n} (n = 25), set of access points 9 = {1, i, m} (m = 6). Each user is described 
by parameter vector Zi, fi,Pi), where vector components are as follows (Table 3): coordinates of 

user (xi,yi,Zi, required frequency bandwidth f t (scale: 1 Mbit/s ... 10 Mbit/s), priority pi (ordinal scale 
[1,2,3], all user requirements are satisfied in case Pi — 1), required reliability fj (ordinal scale [1,10], 10 
corresponds to maximum reliability). Analogically, parameters of access points are considered (by index 
j, Table 4) including parameter rij (maximal possible number of users under service). 



Table 3. Data on users |127j 



i 




Vi 


Zi 


fi 


Pi 


n 












Vi 




fi 


Pt 




1 


30 


165 


5 


10 


2 


5 


i 


Zi 


n 


2 


58 


174 


5 


5 


1 


9 


14 


110 


169 


5 


7 


2 


5 


3 


95 


156 





6 


1 


6 


15 


145 


181 


3 


5 


2 


4 


4 


52 


134 


5 


6 


1 


8 


16 


170 


161 


5 


7 


2 


4 


5 


85 


134 


3 


6 


1 


7 


17 


120 


140 


6 


4 


2 


6 


6 


27 


109 


7 


8 


3 


5 


18 


150 


136 


3 


6 


2 


7 


7 


55 


105 


2 


7 


2 


10 


19 


175 


125 


1 


8 


3 


5 


8 


98 


89 


3 


10 


1 


10 


20 


183 


91 


4 


4 


3 


5 


9 


25 


65 


2 


7 


3 


5 


21 


135 


59 


4 


13 


3 


4 


10 


52 


81 


1 


10 


1 


8 


22 


147 


79 


5 


7 


3 


16 


11 


65 


25 


7 


6 


2 


9 


23 


172 


26 


2 


10 


2 


7 


12 


93 


39 


1 


10 


1 


10 


24 


165 


50 


3 


7 


3 


3 


13 


172 


26 


2 


10 


2 


7 


25 


127 


95 


5 


7 


2 


5 



Table 4. Data on access points [127 
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Further, each pair "user-access point" (i.e., £ $F,j £ O) can be described: (1) reliability 

= min{ri,rj}, (2) distance kj, (3) priority ptj — pi, and (4) required bandwidth = fi. In 
addition, a "connection" parameter is considered: equals 1 if Uj < I and otherwise (L corresponds 
to distance constraint). This parameter defines Vi £ a subset of possible access points 0; C 0. The 
assignment of user i to access point j is defined by Boolean variable Xij (xij = 1 in the case of assignment 

2 From (with amendments): 

M.Sh. Levin, Towards communication network development (structural systems issues, combinatorial problems). IEEE 
Region 8 Int. Conf. Sibircon 2010, vol. 1, (2010) 204-208. 
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i to j and — otherwise). The assignment solution ("J => 9) is defined by Boolean matrix 
X = \\xij\\, i = l,n, j = l,m. The problem formulation is: 

n n n 

max^ ^2 r i] x i], max^ ^ fijXij, max ^ Py^ij 
i=i ieOi i=i jee; »=i jee; 

n n 
i=l i=l j<£0; 

iij6{0,l}, V i = l,n, V j = l,m; Xy ■ = 0, V i = l,n, j G {6 \ 6,}. 

This multiple criteria assignment problem is NP-hard (e.g., |75|). Here, a simplified two-stage heuristic 
is used: (i) transformation of vector estimate for each pair into an ordinal estimate (by multicri- 

teria ranking, ELECTRE-likc technique f [134] . |181j ). (ii) solving the obtained one-criterion assignment 
problem (by greedy algorithm). Fig. 26 depicts the obtained solution: assignment of users to access 
points. 




Fig. 26. Assignment: users - access points |127] 

2.8. Morphological Hierarchy 

Morphological hierarchy generalizes system model from morphological analysis [217] by the following 
ways ( |123] . [P24] . |126] . |128] . |131] . |133] ): (i) hierarchical (tree-like) structure of the examined system; (ii) 
design alternative set for each leaf node (system component) of the system model; (iii) assessment of 
the design alternatives (DAs) on the basis of an ordinal scale (e.g., |123j . |12j] . |126] ) or a special interval 
multiset based scale [133] : and (iv) ordinal compatibility between the design alternatives for different 
system components (instead of binary compatibility that was used in morphological analysis) . 

An example of morphological hierarchy for a compressed plan of car repair from [132] is the following 
(Fig. 27, ordinal estimates of DAs are shown in parentheses; Table 5, Table 6): 

0. Plan S — A* B * C 

1. Payment A = X * F 

1.1. payment scheme X: 100 % payment Xq, prepayment of 50. ..80 percent for parts X\] bank loan 

1.2. version F: cash Fi, credit card F2, bank transfer F3. 

2. Body B = R* Z * M: 

2.1. frame R: None R , technical diagnostics R\, follow-up assembly i? 2 ; 
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2.2. hardware Z: None Zo, replacement of defect parts Z\, repair of body-defects Z2, fitting Z 3 , 
Z4 = Z\!k,Z2, Z§ = Zi&cZs, Zq = Z2&Z3, Z-j = Z\izZ 2 h,Z 3 \ 

2.3. finishing M = U * V: 

2.3.1. painting U: None Uo, partial painting U\, painting U 2 ; 

2.3.2. appearance restoration V: None Vq, Yes V\. 
3. Electric & electronic subsystem C = P * Q: 

3.1. Computer & navigation subsystem P = K * G 

3.1.1. Computer K: None Kq, upgrade K\, additional or new computer K2; 

3.1.2. system GPS G: None G , GPS system d; 

3.2. wiring & lighting Q = O * L 

3.2.1. wiring O: None O , repair 0\; 

3.2.2. lighting L: None Lq, partial replacement L\, replacement L 2 . 
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Fig. 27. Structure of repair plan |132j 
Table 5. Compatibility [T32] 
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Table 6. Compatibility [132] 
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The evident design scheme for morphological hierarchy is the following: 
Stage 1. Design of system hierarchy (one of the methods above). 

Stage 2. Generation of DAs for each leaf node of the system model (expert judgement and/or usage of 
information bases). 
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Stage 3. Generation of scales for assessment of DAs and assessment of DAs (usually multicriteria 
description of DAs is used at a preliminary phase). 

Stage 4- generation of an ordinal scale for compatibility among DAs and assessment of the compatibility. 



3. SOME APPROACHES TO MODIFICATION 

3.1. Modification of Tree via Condensing of Weighted Edges 

This section describes transformation of a tree (with weights of vertices and weights of edge/arcs) via 
integration (condensing) of some neighbor vertices while taking into account a constraint for a total weight 
of the maximum tree tail (i.e., length from root to a leaf vertex). The problem was firstly formulated for 
designing an overlay structure of a modular software system in [121] . The integration of software modules 
requires additional memory, but allows to decrease a time (i.e., frequency) of loading some corresponding 
modules. Other applications of the problem can be examined as well, e.g., hierarchical structure of data, 
call problem, hierarchical information structure of Web-sites. This problem is illustrated in Fig. 28 and 
Fig. 29 by an example for designing the over-lay structure on the basis of module integration, when 
different software or data modules can apply the same parts of RAM. A new kind of FPTAS for the 
above-mentioned combinatorial optimization problem (a generalization of multiple choice problem over 
a tree-like structure and special constraints) was suggested in |121j . 
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Fig. 28. Integration of software modules (over-lay structure) 
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Fig. 29. Usage of memory (RAM) 



Let G — (A, T) be an oriented tree, where A is a set of vertices (software or data modules). T is a 
multi-valued mapping of A into A. Arcs of G are oriented from the root a Q G A to leaf vertices. Each 
vertex a G A has a positive weight (required volume of RAM) (3(a) > 0. Each arc (a ,a ) (a ,a G A 
a G Ta ) has a weight (i.e., an initial frequency of loading into RAM) w(a , a ) > 0. This arc weight 
corresponds to the frequency of calling (and loading) from module a to module a . 

3 From (with amendments): 

(i) M.Sh. Levin, An extremal problem of organization of data. Eng. Cybern., 19(5), 1981, 87-95. 

(ii) M.Sh. Levin, Combinatorial Engineering of Decomposable Systems, Springer, 1998, Chapter 2. 
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Let w(a 1 ,a l ) —< a 1 , a 4 , a 1 > be a path (a J+1 6 Ta J , j = 1, — 1). We propose for each path a 
weight X(n(a 1 ,a l ) — J2i=i A(a l ). Denote by a weight of graph G the value 

X(G) = max a " eAo {X(ir(a ,a ))}, 

where A° = {a e A Ta = 0} is a set of leaf vertices. Let G a = (A a , T) is a subtree with root a e A, 
and A a contains vertex a and all other vertices, which can be reached from a. Graph (A a \ a, T) is called 
tail of vertex a, and value A - (a) = X(G a ) — A(a) is called a tail weight of vertex a. Clearly that 

X(a)=max a > €Ta {X(G a >)}. 

We examine weight w(a) and binary variable x(a) Va e A\a a (1 corresponds to a situation when the arc, 
directed to a, is condensed). Now let us define a transformation of graph G on the basis of integrating 
the vertices a and a as follows: 

(a) vertex a is changed into J{a , a ) with the following properties: 
X(J(a',a")) = A(a') + A(a") TJ(a',a") = (Ta (J Ta") \ a"; 

(b) vertex a and arcs, which are oriented from the vertex, are deleted. 

For graph G, we propose a binary vector n(a) that involves all x(a) Vaei\ a D . Thus, we examine the 
weights of vertex a and its tail as functions of vector n: A(a, /c), A~(a, n). Now let us consider a problem 
(kind 1): 

max W(k) = x(a)w(a) s.t. A(a D , k) + A - (a Q , k) < b, 

a£A\a 

where b is a positive constant (i.e., a volume of accessible RAM). In general, this problem formulation 
corresponds to the example in Fig. 28 and Fig. 29. 

In addition, we examine analogical problem (kind 2) with other constraints as follows: 

A(a ,«;)<fr~, X~(a ,K)<b + , b~+b + =b. 

Note, illustrations of the class of considered combinatorial problems are presented in Fig. 30 (basic 
knapsack problem and multiple choice problem) and in Fig. 31, correspondence of problem to illustration 
is pointed out in Table 7. 

Now consider some simple cases of the problems (kind 1 and kind 2). Let Ta„ — {a\, a^, a m } (and 
Ui) corresponds to an arc (a , a^) (w(ui) =Wi). Then, corresponding problem (problem 1, an equivalent 
to knapsack problem, Fig. 30i) is: 

m m 

max ^^XiWi s.t. A(a D ) + ^ XjA(aj) < b, ^£{0,1}. 

i=l i=l 

The objective function in other simple cases (1.1 - Fig. 31a, 1.2 - Fig. 31b, 1.3 - Fig. 31c, 1.4 - Fig. 
31d), which are based on knapsack problem (problem 1, Fig. 30i) is analogical, and only constraints will 
be presented for them. 

Problem 1.1 (Fig. 31a) has the following constraint: 

m 

X(a a ) + XiX(di) + max ((1 - x j )A(aj)) < b. 

— ' Ki<m 
i=l 

This problem corresponds to a "kernel" load in many software packages. 
Problem 1.2 (Fig. 31b) of kind 2 is the following: 

m 

X(a ) + / XiX(cii) < max ((1 — XjA(aj)) < b + . 

^— ' Ki<rn 
i=l 

Problem 1.3 (Fig. 31c) is: 

m 

A(a D + XiX~(ai) + max ((1 - x { X~ (ai)) + X + (ai)) < b. 

' Ki<m 
i=l 
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It is reasonable to point the following properties of this problem: 

(a) a, (Vaj € Ta ) has the weight A~(aj); 

(b) (Va^ S Ta D ) has the only one son with a weight \ + (a,i), and the value is the tail weight; and 

(c) only condensing the following arcs (a ,ai) (i — 1, m) is admissible. 

As a result, a sequence of simple problems based on knapsack problem can be examined: 1 (basic 
knapsack problem, Fig. 30i), 1.1 (analogue of knapsack problem, Fig. 31a), 1.2 (Fig. 31b), 1.3 (Fig. 
31c), 1.4 (Fig. 31d). 

In the same way, a sequence of auxiliary problems based on multiple choice problem can be considered: 
2 (basic multiple choice problem, Fig. 30ii), 2.1 (analogue of multiple choice problem, Fig. 31e), 2.2 (Fig. 
31f), 2.3 (Fig. 31g), 2.4 (Fig. 31h). 

In our case, multiple choice problem or problem 2 (Fig. 28ii) is the following: 

m qi m qi qi 

max W({xij}) = y^ j y^ j w(a i j)x i j s.t. X(a ) ^ ^ x v] ;A(ay) < 6, ^aJy = 1, i = l,m; xtj 6 {0,1}. 

i=l j=l i=l j=l j=l 

Here, the following set of Boolean vectors in auxiliary problems is used: 
X = {k = (xjpXi^lxjpX^ £ {0, 1}; j = l,q l ; i = l,m} 

In addition, the following constraint has to be taken into account in all auxiliary problems: 

qi 

i=i 

Also, the following modified objective function is used: 

m qi 

i=i j=i 

Now, for example, auxiliary problem 2.4 is considered that corresponds to kind 2 above (Fig. 31h): 

m qi 

A(a Q ) +^2^2^ j X~(a ij ) <b~, max((l - x^)X~(aij) + A + (a y )) < 

For the sequence of simple problems above, we can apply approximation algorithms, which are based 
on an e-approximate algorithm (e e [0, 1]) for knapsack problem (e.g., |110j . |156j . |183j . |184j ). In the 
case of these algorithms, an estimate of an operation number is similar for knapsack problem (e.g., [1 lOj . 
[156] ). and equals 0{^) |121j . The algorithms apply ordering of elements from set Ta by non-decreasing 
of \(a,i) or (A~(aj) + A+(ai)). 

The solving process of auxiliary problems is based on similar approximation approach to multiple choice 
problem with the following estimates of number of operations and required memory accordingly (e.g., 
|110j . |156j ): 0(y Ym=i Qi)> 0{~ max i<i<m{'?i})- Unfortunately, we could not construct an algorithm 
with similar estimates for the auxiliary problem 2.3 (Fig. 31g) |121j . As a result, (e, (J)-approximate 
algorithms with the following estimates (number of operations, and required memory) were suggested 
[121] : 0(^ J27=i 0(^- maxi<j< m {gi}), where S is a relative error for constraints. 




(i) knapsack problem (1) (ii) multiple choice problem (2) 



Fig. 30. Illustration for knapsack and multiple choice problems |123] 
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(d) kind 2 (1.4) (h) kind 2 (2.4) 



Fig. 31. Illustration for simplest and auxiliary problems 123 



Table 7. Knapsack-like problems - illustrations 



Problem 


Illustration 


4. Basic knapsack problem 


(problem 


1) 


Fig. 


30i 


5. Problem 1.1 






Fig. 


31. a 


5. Problem 1.2 






Fig. 


31.b 


5. Problem 1.3 






Fig. 


31.c 


5. Problem 1.4 






Fig. 


31.d 


4. Multiple choice problem 


(problem 


2) 


Fig. 


30ii 


5. Problem 2.1 






Fig. 


31.e 


5. Problem 2.2 






Fig. 


31.f 


5. Problem 2.3 






Fig. 


31.g 


5. Problem 2.4 






Fig. 


31.h 



When G is a fc-level tree, the algorithm is based on cascade- like 'Bottom- Up' process (Fig. 32) |121j : 
Step 1. Problem 1.2. 



Step j (j = 2,k- 2). Problem 2.4 
Step (k-1). Problem 2.3. 
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Estimates of the algorithms are as follows (i.e., operations, and memory): 0( 



n 2 V 5 (a ) 

— 75* — 



where m(a) = |T(o)|, m = ■max a £A'm(a) 1 77(a) =| A a \{a e i a | Ta = 0} |. 

2 4/ \ 

In the case of 3-level tree, the estimate of the operation number is: Q( - y ). 

Prospective generalizations of the problem may involve the following: (a) multicriteria descriptions 
of the elements, (b) more complicated structure (e.g., parallel-series graph), (c) uncertainty, and (d) 
dynamics. 



Problem 2.3 
(Fig. 29g) 



Problem 2.4 
(Fig. 29h) 



Problem 1.2 
(Fig. 29b) 




Problem 2.4 
(Fig. 29h) 



Problem 1.2 
(Fig. 29b) 



Problem 1.2 
(Fig. 29b) 




Problem 1.2 
(Fig. 29b) 



Fig. 32. 'Bottom-Up' solving scheme for tree-like case (1 23] 

3.2. Hotlink Assignment Problem 

In recent decade, the concept of "hotlinks" has been introduced for decreasing the complexity of access 
in web directories (or similar information structures) via usage (inserting) of a limited set of additional 
hyperlinks (i.e., "hotlinks") to data (e.g., [26].|53 ) . |171j V In general, "hotlink assignment problem" is a 
network upgrade problem (e.g., [59"]): 

Find additional new arc(s) to the initial graph in order to insert shortcuts and decrease the expected 
path length. 

Mainly, the problem is examined for trees. Let T = (A, E) be a directed tree with maximum degree d, 
rooted at a node r G A (elements of A correspond to Web sites, elements of E correspond to hyperlinks). 
A node weight equals its access (search) frequency (probability). It is assumed that required information 
is contained at leaf nodes (for simplicity). The length of the search for node v G A equals the number of 
links in the path from ro to v. 

Let T u = (A u , E u ) be a subtree of T (A u C A, E u C E), rooted at node u G A (here u is not the son 
of ro). Thus, additional direct link ("hotlink") will be as follows: (ro,u). In this case, a path to all leaf 
nodes in T u will be smaller. 

Fig. 33, Fig. 34, Fig. 35, and Fig. 36 illustrate the simplest versions of "hotlink assignment" problem. 
Note, Fig. 36 depicts the usage of internal nodes as additional roots (i.e., ri). 




Fig 33. Hotlink assignment problem (one hotlink) 
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Fig 34. Hotlink assignment problem (two hotlinks) 




Fig 35. Hotlink assignment problem (three hotlinks) 




Fig 36. Hotlink assignment problem (three hotlinks, one internal root) 



The basic "hotlink assignment" problem consists in assignment of k additional "hotlinks" (from the 
root) to minimize the total number of steps to visit the required information nodes. On the other 
hand, it is necessary to find the set of k nodes (U = {u}) at the tree T. In recent years, various 
"hotlink assignment" problems have been intensively studied (e.g., algorithm design, issues of complexity, 
approximation) (e.g., [26] , [46] , [52] , [53] , [69] , [104] , [IT7] , [119], [157], [T7T])- Some versions of the problem 
are presented in Table 8. Mainly, "hotlink assignment" problems belong to class of NP-hard problems 
(e.g., [104 ). Many approximation algorithms have been suggested for the problems (including FPTAS) 

(e.g., QIB], nm, E5ZD- 

In the case of multiple attribute description of "hotlinks" or/and selected nodes/subtrees (i.e., nodes 
as u), multicriteria knapsack like problems or multicriteria generalized assignment problems may be used 
(e.g., [138) , |139) , |150) . [170) 1. It may be reasonable to examine some generalizations of the "hotlink 
assignment" problem, for example: (i) taking into account uncertainty, (ii) tree-inclusion problem (as 
"hot-tree assignment" problem). Application of "hotlink assignment" problems is very useful for many 
domains, e.g., adaptive Web sites systems, knowledge bases, file systems, menus systems, asymmetric 
communication protocols (e.g., [26], g6J, QH], ES], [T7T] V 



Table 8. Hotlink assignment problems 



Problem 


Source 


1. Basic hotlink assignment 
2. Single hotlink assignment 

3. Hotlinks only for leafs 

4. Multiple hotlink assignment 

5. Dynamic hotlink assignment 


[26], (171) 
[33,1116] 

[53], [69] 
[52] 



Generally, "hotlink assignment" problem is a special case of "graph augmentation problem" (e.g., 



25 



[63 ] , [111] ). Here the goal is to modify an initial graph (e.g., by edges) such that the augmented graph 
will by satisfied some requirements (e.g., as increasing the connectivity). 

3.3. Scheme for Transformation of Tree to Steiner Tree 

Here a transformation of a tree T — (A,E) into Steiner tree S — (A',E') is considered as addition of 
Steiner points into an initial tree (or a preliminary built spanning tree) while taking into account the 
following: "cost" (required resource) of each Steiner point, "profit" of each Steiner point, total resource 
constraint (i.e., total "cost" of the selected Steiner points) [141] , A simplest case is considered when 
Steiner points for triangles are only examined. Evidently, vector-like "cost" and "profit" can be used as 
well. The solving scheme is the following: 

Stage 1. Identification (e.g., expert judgment, clustering) of m regions (clusters, groups of neighbor 
nodes) in the initial tree T for possible addition of Steiner points. 

Stage 2. Generation of possible Steiner points (candidates) and their attributes (i.e., cost of addition, 
"profit"). 

Stage 3. Formulation of multiple choice problem for selection of the best additional Steiner points 
while taking into account resource constraint (s): 



EE 



m qi q% 
"ij'Eij S.t. ^ ^ Q>ij%ij ^ %ij — 1, %ij G {0, 1}, 

i=l j = l j=l 



where i is the index of region (i = 1, m), qi is the number of versions for addition of Steiner points in 
region i = 1, m, j is the index of version for addition of Steiner points in region (j = 1, qi ), Xij is binary 
variable that equals 1 if version j in region i is selected, & is a total constraint for the required resources 
(i.e., a total "cost"). 

Stage 4- Solving the multiple choice problem to obtain the resultant Steiner tree S. 

A numerical illustrative example illustrates the scheme. Initial tree is (Fig. 37): T = (A,E), A = 
{1,2,3,4,5,6,7,8,9,10,11}. Four regions are defined (Fig. 37): region 1: {1, 2, 3, 4}; region 2: {4,6,7}; 
region 3: {4, 5, 6, 9, 11}; and region 4: {7,8,10}. The considered Steiner points are the following (Fig. 38): 
region 1: Sn,si2; region 2: s 2 i! region 3: s 31 ,s 32 ; and region 4: s 41 . Table 9 contains for multiple choice 
problem: binary variables and corresponding attributes (required resource as "cost" , possible "profit" ) . 
Thus, the problem is: 



4 qi A qi qi 

max CijXij s.t. < b, }^ Xjj = 1, iy e {0, 1}. 

i—l i—1 i—1 j — 1 j — 1 
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Steiner points Steiner points Stciner points 

for regions 2, 4 for regions 1, 3 for regions 1, 3 




Fig. 38. Additional Steiner points for regions 



Table 9. Data for multiple choice problem 



Region 


Binary 


Steiner 


"Cost" 


"Profit" 




variable 


point 


Cij 




Region 1 


x n 


None 


0.0 


0.0 




212 


Sll 


3.1 


1.5 




Xl3 


S12 


1.2 


1.4 


Region 2 


%21 


None 


0.0 


0.0 




X22 


S21 


2.0 


1.3 


Region 3 


Z31 


None 


0.0 


0.0 




^32 


S31 


2.4 


1.4 




£33 


S32 


1.8 


1.3 


Region 4 


X41 


None 


0.0 


0.0 




^42 


S41 


1.5 


1.2 



Some obtained solutions (i.e., as set of additional Steiner points) are the following (a simple greedy 
heuristic was used) (Fig. 39): 

(1) bi = 2.9: x bl : x\ 2 = 1, x 2 \ = 1, x 32 = 1, £41 = 1, Steiner points Z bl = {sn,s 3 i}, 
total (additive) "profit" c = 5.5; 

(2) 6 2 = 4.2: x 62 : x 12 = 1, x 22 = 1, £32 = 1, ^42 = 0; Steiner points Z b2 = {s n , s 2 i, «3i}i 
total (additive) "profit" c = 7.5; 

(3) 63 = 5.4: x b3 : x i2 = 1, x 22 = 1, x 32 = 1, x 42 = 1, Steiner points Z bs = {s n , s 2 i, S31, s 4 i}, 
total (additive) "profit" c = 9.0. 

Note, more complicated problem can be examined, for example, while taking into account the following: 
(i) several additional Steiner points in the same region, (ii) compatibility of points additions in neighbor 
regions. 




• • • 



Fig. 39. Solutions 
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3.4. Towards Restructuring Problems 

In recent several years, a special class of combinatorial optimization problems as "reoptimization" has 
been intensively studied for several well-known problems: (a) minimum spanning tree problem [25) . (b) 
traveling salesman problems (|S], [TO]), (c) Steiner tree problems (d) covering problems [2"0]. 

In general, the reoptimization problem is formulated as follows: 

Given: (i) an instance of the combinatorial problem over a graph and corresponding optimal solution, 
(ii) some "small" perturbations (i.e., modifications) on this instance (e.g., node-insertion, node-deletion). 

Question: Is it possible to compute a new good (optimal or near-optimal) solution subject to minor 
modifications ? 



A survey of complexity issues for reoptimization problems is presented in |24j . Mainly, the problems 
belong to class of NP-hard problems and various approximation algorithms have been suggested 

In [129j , the author has suggested another approach to modification in combinatorial optimization prob- 
lems as "restructuring". The approach corresponds to many applied reengineering (redesign) problems 
in existing modular systems. The restructuring process is illustrated in Fig. 40 [129] . Here modifications 
are based on insertion/deletion of elements (i.e., elements, nodes, arcs) and changes of a structure as well. 
Two main features of the restructuring process are examined: (i) a cost of the initial problem solution 
restructuring (i.e., cost of the selected modifications), (ii) a closeness the obtained restructured solution 
to a goal solution. 



Requirements 
(for n) 




Restructuring: 

S 1 => S* 
while taking 
into account: 

(i) S* is close 
to S 2 , 

(ii) change of S 1 
into S* is cheap. 



Requirements 
(for 7*0 




Solution S 2 ) 



on t 2 t 

Fig. 40. Illustration for restructuring process |129j 

This kind of problems corresponds to redesign/reconfiguration (improvement, upgrade) of modular 
systems and the situations can be faced in complex software, algorithm systems, communication networks, 
computer networks, information systems, manufacturing systems, constructions, etc. (e.g, [127] . |129j ). 
The optimization problem is solved for two time moments: n and 72 to obtain corresponding solutions S 1 
and S 2 . The examined restructuring problem consists in a "cheap" transformation (change) of solution 

5 1 to a solution S* that is very close to S 2 . In |129j . this restructuring approach is described and 
illustrated for the following combinatorial optimization problems: knapsack problem, multiple choice 
problem, assignment problem, spanning tree problems. Evidently, this restructuring problem may be 
used for many combinatorial optimization problems as changing a solution (e.g., subset, structure). Fig. 
41 depicts the restructuring problem |129j . 

Let P be a combinatorial optimization problem with a solution as structure S (i.e., subset, graph), 
Q, be initial data (elements, element parameters, etc.), f(P) be objective function(s). Thus, S(SV) be a 
solution for initial data ft, f(S(fl)) be the corresponding objective function. Let fi 1 be initial data at an 
initial stage, /(5 , (fi 1 )) be the corresponding objective function, ft 2 be initial data at next stage, f(S(fl 2 )) 
be the corresponding objective function. 

As a result, the following solutions can be considered: (a) S 1 — S^fi 1 ) with /(S^fi 1 )) and (b) 

5 2 = S(fl 2 ) with f(S(ft 2 )). In addition it is reasonable to examine a cost of changing a solution into 
another one: H(S a —> Let p(S a ,S^) be a proximity between solutions S a and S^, for example, 
p(S a ,S@) — \f(S a ) — f(S^)\. Note, function f(S) is often a vector function. Finally, the restructuring 
problem can be examine as follows (a basic version): 



28 



Find a solution S* while taking into account the following: 
(i) HIS 1 -> S*) — > min, (ii) p(S*, S 2 ) — > min (or constraint). 

Thus, the basic optimization model can be examined as the following: 

min p(S*,S 2 ) s.t. HiS 1 -> S*) <h, 

where ft, is a constraint for cost of the solution change. 

Proximity function p(S* , S 2 ) can be considered as a vector function (analogically for the solution 
change cost). The situation will lead to a multicriteria restructuring problem (i.e., searching for a Pareto- 
efficient solutions). 



"Quality" 



Obtained 
solution S 




Solution 
change cost 
H^S 1 -> S*\ 



S 



Goal 
solution 
(i = r 2 ): S 2 



Neighborhoods 
Proximity ? ^2 

O' Initial P(S*,S 2 ) 01 
\ solution 

(t = T 1 ) 



Fig. 41. Illustration for restructuring problem |129] 



4. CONCLUSION 

The paper contains a generalized integrated glance to design of system hierarchies. The problems of 
this kind are often crucial ones in system analysis, in systems design. The following basic hierarchical 
structures are considered: trees, a special morphological hierarchy, multi-layer structures. Evidently, the 
examination is based on combinatorial optimization models. The list of design methods involves the 
following: expert-based 'top-down' procedure ('divisive' strategy of hierarchical clustering), hierarchical 
clustering (agglomerative algorithm), spanning trees, ontological approach, 'optimal' organization, multi- 
layer structures (including multilevel network design problem). 

A special attention is targeted to modification problems. Here, a simplified modification of trees is 
examined (e.g., augmentation problem for graphs is not considered [53]). The presented modification 
problems are as follows: modification of a weighted tree via condensing neighbor nodes (i.e., design of 
over- lay structure of tree-like software) , hotlink assignment problem, building of a Steiner tree based on 
an initial tree, new class of restructuring problems. 

Generally, our material is the first integrated attempt to the problem. As a result, many considered 
issues require additional studies and analysis of many applied examples, many other significant problems 
have to be described and studied. For example, issues of uncertainty have not been examined. Mainly, 
considered problems are described on the basis of a simplified frames (e.g., engineering description, 
problem formulation, basic solving schemes, prospective versions). 

The presented material is oriented to needs of applications. In the future, it may be interesting to 
use the described approaches in education (engineering, management, computer science and information 
technology, applied mathematics). This material can be considered and used as a tutorial. 
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